Overview

Dataset Statistics

Number of Variables 13
Number of Rows 2803
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 435.4 KB
Average Row Size in Memory 159.0 B
Variable Types
  • Numerical: 12
  • Categorical: 1

Dataset Insights

Solidity is skewed Skewed
Compactness is skewed Skewed

Variables


Length

numerical

Approximate Distinct Count 1944
Approximate Unique (%) 69.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 289.9264
Minimum 151.3353
Maximum 515.3525
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Length is skewed right (γ1 = 0.6365)

Quantile Statistics

Minimum 151.3353
5-th Percentile 203.6737
Q1 245.026
Median 279.8779
Q3 329.1509
95-th Percentile 411.3528
Maximum 515.3525
Range 364.0172
IQR 84.1249

Descriptive Statistics

Mean 289.9264
Standard Deviation 62.1812
Variance 3866.498
Sum 812663.6361
Skewness 0.6365
Kurtosis 0.0661
Coefficient of Variation 0.2145
  • Length has 21 outliers

Width

numerical

Approximate Distinct Count 1859
Approximate Unique (%) 66.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 170.8074
Minimum 88.0505
Maximum 258.5698
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Width is skewed right (γ1 = 0.1955)

Quantile Statistics

Minimum 88.0505
5-th Percentile 126.3836
Q1 149.5894
Median 169.9241
Q3 190.6404
95-th Percentile 224.5966
Maximum 258.5698
Range 170.5193
IQR 41.051

Descriptive Statistics

Mean 170.8074
Standard Deviation 29.5879
Variance 875.4447
Sum 478773.1511
Skewness 0.1955
Kurtosis -0.0521
Coefficient of Variation 0.1732
  • Width is not normally distributed (p-value 6.050152904181807e-10)
  • Width has 9 outliers

Thickness

numerical

Approximate Distinct Count 1797
Approximate Unique (%) 64.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 109.7924
Minimum 59.4943
Maximum 181.8452
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Thickness is skewed right (γ1 = 0.1146)

Quantile Statistics

Minimum 59.4943
5-th Percentile 78.1972
Q1 97.349
Median 110.4463
Q3 121.5958
95-th Percentile 140.165
Maximum 181.8452
Range 122.3509
IQR 24.2469

Descriptive Statistics

Mean 109.7924
Standard Deviation 18.9462
Variance 358.9581
Sum 307748.226
Skewness 0.1146
Kurtosis 0.2239
Coefficient of Variation 0.1726
  • Thickness has 33 outliers

Area

numerical

Approximate Distinct Count 2750
Approximate Unique (%) 98.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 26511.1174
Minimum 6037
Maximum 89282
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Area is skewed right (γ1 = 1.2418)

Quantile Statistics

Minimum 6037
5-th Percentile 10611.25
Q1 16211.5
Median 23440.5
Q3 33451
95-th Percentile 54367.3
Maximum 89282
Range 83245
IQR 17239.5

Descriptive Statistics

Mean 26511.1174
Standard Deviation 13782.5613
Variance 1.8996e+08
Sum 7.4311e+07
Skewness 1.2418
Kurtosis 1.6223
Coefficient of Variation 0.5199
  • Area has 87 outliers

Perimeter

numerical

Approximate Distinct Count 2793
Approximate Unique (%) 99.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 743.8638
Minimum 311.5635
Maximum 1864.9474
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Perimeter is skewed right (γ1 = 0.9186)

Quantile Statistics

Minimum 311.5635
5-th Percentile 436.0526
Q1 571.73
Median 707.4874
Q3 878.8965
95-th Percentile 1151.6952
Maximum 1864.9474
Range 1553.3839
IQR 307.1665

Descriptive Statistics

Mean 743.8638
Standard Deviation 230.6321
Variance 53191.1545
Sum 2.0851e+06
Skewness 0.9186
Kurtosis 1.1256
Coefficient of Variation 0.31
  • Perimeter is not normally distributed (p-value 0.004605333863233486)
  • Perimeter has 61 outliers

Roundness

numerical

Approximate Distinct Count 1944
Approximate Unique (%) 69.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 0.4695
Minimum 0.1737
Maximum 0.6973
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Roundness is skewed left (γ1 = -0.3003)

Quantile Statistics

Minimum 0.1737
5-th Percentile 0.2548
Q1 0.3836
Median 0.4714
Q3 0.5773
95-th Percentile 0.6384
Maximum 0.6973
Range 0.5235
IQR 0.1937

Descriptive Statistics

Mean 0.4695
Standard Deviation 0.1187
Variance 0.01409
Sum 1316.0808
Skewness -0.3003
Kurtosis -0.8742
Coefficient of Variation 0.2528

Solidity

numerical

Approximate Distinct Count 2800
Approximate Unique (%) 99.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 0.9558
Minimum 0.7188
Maximum 0.9929
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Solidity is skewed left (γ1 = -2.1738)

Quantile Statistics

Minimum 0.7188
5-th Percentile 0.8706
Q1 0.9446
Median 0.9704
Q3 0.9815
95-th Percentile 0.9886
Maximum 0.9929
Range 0.2741
IQR 0.0369

Descriptive Statistics

Mean 0.9558
Standard Deviation 0.0396
Variance 0.001568
Sum 2679.1861
Skewness -2.1738
Kurtosis 5.4483
Coefficient of Variation 0.04143
  • Solidity is not normally distributed (p-value 3.7885792066340233e-10)
  • Solidity has 208 outliers

Compactness

numerical

Approximate Distinct Count 2800
Approximate Unique (%) 99.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 1.8252
Minimum 1.1645
Maximum 9.6601
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Compactness is skewed right (γ1 = 3.2164)

Quantile Statistics

Minimum 1.1645
5-th Percentile 1.2486
Q1 1.3574
Median 1.5764
Q3 1.966
95-th Percentile 3.3776
Maximum 9.6601
Range 8.4956
IQR 0.6086

Descriptive Statistics

Mean 1.8252
Standard Deviation 0.7941
Variance 0.6305
Sum 5116.1295
Skewness 3.2164
Kurtosis 14.8586
Coefficient of Variation 0.435
  • Compactness is not normally distributed (p-value 1.385859124498169e-12)
  • Compactness has 209 outliers

Aspect_Ratio

numerical

Approximate Distinct Count 1003
Approximate Unique (%) 35.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 1.7543
Minimum 1.4001
Maximum 2.7313
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Aspect_Ratio is skewed right (γ1 = 1.302)

Quantile Statistics

Minimum 1.4001
5-th Percentile 1.5184
Q1 1.6153
Median 1.7059
Q3 1.8386
95-th Percentile 2.154
Maximum 2.7313
Range 1.3312
IQR 0.2233

Descriptive Statistics

Mean 1.7543
Standard Deviation 0.2039
Variance 0.04159
Sum 4917.1774
Skewness 1.302
Kurtosis 2.0479
Coefficient of Variation 0.1163
  • Aspect_Ratio is not normally distributed (p-value 0.002387941199396806)
  • Aspect_Ratio has 130 outliers

Eccentricity

numerical

Approximate Distinct Count 1003
Approximate Unique (%) 35.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 0.8136
Minimum 0.6999
Maximum 0.9306
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Eccentricity is skewed right (γ1 = 0.2977)

Quantile Statistics

Minimum 0.6999
5-th Percentile 0.7525
Q1 0.7846
Median 0.8103
Q3 0.8397
95-th Percentile 0.8879
Maximum 0.9306
Range 0.2307
IQR 0.05506

Descriptive Statistics

Mean 0.8136
Standard Deviation 0.0413
Variance 0.001706
Sum 2280.5544
Skewness 0.2977
Kurtosis -0.3771
Coefficient of Variation 0.05076
  • Eccentricity has 11 outliers

Extent

numerical

Approximate Distinct Count 2800
Approximate Unique (%) 99.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 0.7246
Minimum 0.4545
Maximum 0.8458
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Extent is skewed left (γ1 = -1.3322)

Quantile Statistics

Minimum 0.4545
5-th Percentile 0.6306
Q1 0.7017
Median 0.7337
Q3 0.7576
95-th Percentile 0.7818
Maximum 0.8458
Range 0.3913
IQR 0.05588

Descriptive Statistics

Mean 0.7246
Standard Deviation 0.04747
Variance 0.002254
Sum 2031.0168
Skewness -1.3322
Kurtosis 2.6727
Coefficient of Variation 0.06552
  • Extent is not normally distributed (p-value 0.0011969842898268228)
  • Extent has 112 outliers

Convex_Area

numerical

Approximate Distinct Count 2737
Approximate Unique (%) 97.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 44848
Mean 27696.2182
Minimum 6355
Maximum 90642.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Convex_Area is skewed right (γ1 = 1.2294)

Quantile Statistics

Minimum 6355
5-th Percentile 11025.95
Q1 17088.5
Median 24589
Q3 34863.25
95-th Percentile 56392.8
Maximum 90642.5
Range 84287.5
IQR 17774.75

Descriptive Statistics

Mean 27696.2182
Standard Deviation 14237.3476
Variance 2.027e+08
Sum 7.7632e+07
Skewness 1.2294
Kurtosis 1.6016
Coefficient of Variation 0.5141
  • Convex_Area has 89 outliers

Type

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 199007

Length

Mean 5.9979
Standard Deviation 0.8147
Median 6
Minimum 5
Maximum 7

Sample

1st row MAMRA
2nd row MAMRA
3rd row MAMRA
4th row MAMRA
5th row MAMRA

Letter

Count 16812
Lowercase Letter 0
Space Separator 0
Uppercase Letter 16812
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (SANORA, MAMRA) take over 50.0%

Interactions

Correlations

Missing Values